A CENTRAL LIMIT THEOREM FOR THE DETERMINANT OF 

A WIGNER MATRIX 



TERENCE TAG AND VAN VU 



Abstract. We establish a central limit theorem for the log-determinant log | det(M„)| 
of a Wigner matrix Mn , under the assumption of four matching moments with 
either the GUE or GGE ensemble. More specifically, we show that this log- 
determinant is asymptotically distributed like A'^{log %/n\ — i log n, i log n)^ 
when one matches moments with GUE, and N{log\/n\ — j log n, ;j log ri)^ 
when one matches moments with GOE. 
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1. Introduction 



Random matrix theory is an important subject in mathematics with appUcations 
to various areas such as numerical analysis, mathematical physics, statistics, num- 
ber theory and computer science, to mention a few. One of the main goals of 
this theory, by and large, is to understand the distribution of various interesting 
functionals of a random matrix that naturally arise from linear algebra. 

One of most natural and important matrix functionals is the determinant. As 
such, the study of determinants of random matrices has a long and rich history. 
The earlier papers on this study focused on the determinant det An of the non- 
Hermitian iid model An, where the entries Qj of the matrix were independent 
random variables with mean and variance 1. The earliest paper we find here 
belongs to Szekeres and Turan [39j , in which they studied an extremal problem. In 
the 1950s, there were a series of papers [16l [35l |48l [36] devoted to the computation of 
moments of fixed orders of of the determinant (see also pQ]). The explicit formula 
for higher moments get very complicated and in general not available, except in 
cases when the atom variables have some special distribution (see, for instance [9 ) . 

One can use the estimate for the moments and the Chebyshev inequality to obtain 
an upper bound on the magnitude | detyl„| of the determinant. However, no lower 
bound was known for a long time. In particular, Erdos asked whether det An is 
non-zero with probability tending to one. In 1967, Komlos ^25, i26j addressed this 
question, proving that almost surely | det An \ > for random Bernoulli matrices 
(where the atom variables are iid Bernoulli, taking values ±1 with probability 1/2). 
His method also works for much more general models. Following [25j, the upper 
bound on the probability that det An = has been improved in [24j |45l |46l [6] . 
However, these results do not say much about the value of | det An\ itself. 

A few years ago, the authors [IS] managed to prove that for Bernoulli random 
matrices, with probability tending to one (as n tends to infinity) 

(1) Vnlexp{~c\/n\ogn) < |det^„| < VrJ.uj{n) 

for any function uj{n) tending to infinity with n. This shows that almost surely, 
log I det A„| is (i -I- o(l))nlogn, but does not otherwise provide much information 
on the limiting distribution of the log determinant. For related works concerning 
other models of random matrices, we refer to |38] . 

In [21], Goodman considered random Gaussian matrices An = {Cij)i<i,j<n where 
the atom variables are iid standard real Gaussian variables, C,ij = N{0, 1)r. He 
noticed that in this case the square of the determinant can be expressed as the 
product of independent chi-square variables. Therefore, its logarithm is the sum 
of independent variables and thus one expects a central limit theorem to hold. In 




Here and in the sequel, — > denotes convergence in distribution. 
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where -/V(0, 1)r denotes the law of the real Gaussian with mean and variance 1; 
see e.g. |37] for a proof. Informally, we may write this law as 

(3) |det^„| «n-i/2V^exp(Ar(0,ilogn)R). 

We remark that because the second moment of exp(A^(0, t)R) is e^* for any t > 0, 
this law is consistent with the second moment identity 

(4) E|detA.|' = n!, 

for iid matrices (and in particular, for Gaussian matrices) that was first observed 
by Turan [481 , and easily derivable from the Leibniz expansion 

n 

(5) det An= ^ sgn(CT) Y[ Qaii) 

ctGS„ 4=1 

after observing that the terms on the right-hand side are pairwise uncorrelated in 
the iid case. 

A similar analysis (but with the real chi distribution replaced by a complex chi 
distribution) also works for complex Gaussian matrices, in which Qj remain jointly 
independent but now have the distribution of the complex Gaussian A^(0, l)c (or 
equivalently, the real and imaginary parts of Qj are independent and have the 
distribution of A''(0, |)r). In that case, one has a slightly different law 

(6) log{\detA„\)^llognl + llogn ^ 

y i log n 

or more informally 

(7) |detA„| «n-i/4V^exp(Ar(0,ilogn)M). 
Again, this remains consistent with Q. 

We turn now to more general real iid matrices, in which the Qj are jointly indepen- 
dent and real with mean zero and variance one. In [TB], Girko stated that holds 
for such random matrices under the additional assumption that the fourth moment 
of the atom variables is 3. Twenty years later, he claimed a much stronger result 
which replaced the above assumption by the assumption that the atom variables 
have bounded (4 -|- (5)-th moment [19j . However, there are several points which are 
not clear in these papers. Recently, Nguyen and the second author |34j gave a new 
proof for ^ under an exponential decay hypothesis on the entries. Their approach 
also results in an estimate for the rate of convergence and is easily extended to 
handle to complex case. 

The analysis of the above random determinants relies crucially on the fact that 
the rows of the matrix are jointly independent. This independence no longer holds 
for Hermitian random matrix models, which makes the analysis of determinants of 
Hermitian random matrices more challenging. The Hermitian version of Komlos' 
result [25l [26] was posed as an open question by Weiss in the 1980s and was solved 
only five years ago |23j and for this purpose the authors needed to introduce the 
quadratic analogue of Littlewood-Offord-Erdos theorem. The analogue of ID) was 
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first proved in |42l Theorem 31], as a corollar}!^ of tire Four Moment tfieorem. But 
mucli as in tfie situation in tire non-Hermitian case, ttiese proofs do not reveal much 
information about the limiting distribution of the determinant. 

Let us now narrow down our consideration to the foUowing class of random ma- 
trices. 

Definition 1 (Wigner matrices). Let n > 1 be an integer. An n x n Wigner 
Hermitian matrix Mn is defined to be a random Hermitian n x n matrix M„ with 
upper triangular complex entries Cij and diagonal real entries Qi {1 < i < n) jointly 
independent, with mean zero and variance one for 1 < i < j < n, and mean zero 
and variance for 1 < i < n and some ct^ > independent of n. We refer to the 
distributions of the Qj as the atom distributions of A/„. 

We say that the Wigner matrix ensemble obeys Condition CI for some constant 
Co if one has 

for all 1 < i,j < n and some constant Ci independent of n. 

Example 2. The famous Gaussian Unitary Ensemble (GUE) is the special case of 
the Wigner ensemble in which the atom distributions are given by the complex 
Gaussian A^(0, l)c for 1 < i < j < n and the real Gaussian N{0, 1)11 for 1 < i = 
j < n, thus in this case = 1. At the opposite extreme, the complex Hermitian 
Bernoulli ensemble is an example of a discrete Wigner ensemble in which the atom 
distributions is equal to ±-^±-^V~l (with independent and uniform Bernoulli 
signs) for 1 < z < j < n, and equal to ±1 ioi I < i — j < n (so again = 1). 

Another important example is the Gaussian Orthogonal Ensemble (GOE) in which 
the atom distributions are given by N{0, 1)r for 1 < z < j < n and iV(0, 2)k for 
1 < 2 = J < n, thus (7^ = 2 in this case. Finally, the symmetric Bernoulli ensemble 
is an example in which Qj = ±1 for all 1 < i < j < n, with = 1. 

All of the above examples obey Condition CI for arbitrary Cq. 

We now consider the distribution of the determinant for Wigner matrices. We 
first make the observation that the first and second moments of the determinant 
are slightly different in the Wigner case than in the iid case: 

Theorem 3 (First and second moment). Let Mn be a Wigner matrix. 

• (First moment) When n is odd, then EdetM„ — 0. When n is even, one 
has 

EdetM„.(-l)"/^^-^. 



This theorem requires the atom variable has vanishing third moment, but one can remove this 
requirement using very recent estimates of Nguyen I33| and Vershynin I49| on the least singular 
value. 
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In particular, by Stirling 's formula one has 




• (Second moment) If Mn is drawn from GOE, then ha^ 
n^^^nl < E|detM„p < n^/^n!, 



while if Mn is instead drawn from GUE, then 



n^l'^nX < E|detM„|2 < n^l'^nX. 



Proof. See Appendix |^ More precise asymptotics for these moments in the GUE 
case were estabhshed by Brezin and Hikami [3 (see also [15], [T^, 00]), however we 
give an elementary and self-contained proof of the above results in the appendix. □ 

Even in the GUE case, it is highly non-trivial to prove an analogue of the central 
limit theorem ([6]); this was first achieved in [8 via a lengthy computation using 
the explicit formula for the joint distribution of the eigenvalues. Notice that the 
observation of Goodman does not apply due to the dependence between the rows 
and so it is not even clear why a central limit theorem must hold for the log- 
determinant. 

While it does not seem to be possible to express the log-determinant of GUE or 
GOE as a sum of independent random variables, in this paper we present a way to 
approximate the log-determinant as a sum of weakly dependent terms, based oiQ 
analysing a tridiagonal form of both GUE and GOE due to Trotter [47]. Using 
stochastic calculus and the martingale central limit theorem (see Section [2]), we 
give a new proof of the following result. 

Theorem 4 (Central limit theorem for log-determinant of GUE and GOE). Let 
Mn be drawn from GUE. Then 



log|det(M„)| 



i log n! + i log n 



N{0,1) 




Similarly, if Mn is drawn from GOE rather than GUE, one has 



log I det(M„)| - ^ log n! + \logn 




%/log n 



Informally, this theorem asserts that 



|detAf„| « n-^/^V^.eMN{0, -\ogn)M) 



for GUE, and 



|detM„| 



n 



i/'^v^exp(A^(0,logn)K) 



''See Section 1 1.1 1 for the asymptotic notation we will use, including Vinogradov's notation 
■^We would like to thank R. Killip for suggesting the use of Trotter's form. 
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for GOE (compare with ([3]), ([7])). Note also that these distributions are consistent 
with the moment computations in Theorem [3l 

As mentioned previously, Theorem |4] has also been proven (using the explicit joint 
density distribution of the GUE and GOE eigenvalues) by Delannay and Le Caer [8] . 
However our approach is quite different in nature and somewhat less computational, 
and may be of independent interest. 

The next task is to extend beyond the GUE or GOE case. Our main tool for this 
is a four moment theorem for log-determinants of Wigner matrices, analogous to 
the four moment theorems for eigenvalues [42], [41], [43], Green's functions [13], 
and eigenvectors [44], [29]. Let us say that two Wigner matrices M„ = {Cij)i<i,j<n 
and — (Cij)i<ij<n match to order m off the diagonal and to order k on the 
diagonal if one has 

E(ReC.,)'^(ImC.,)'' - E(ReC',r (ImC,)" 

for all 1 < i < j < n and natural numbers a, 6 > with a + b < m for i < j and 
a + b < k for i = j. 

Theorem 5 (Four moment theorem for determinant). Let Af„, Af^ be Wigner ma- 
trices whose atom distributions have independent real and imaginary parts that 
match to fourth order off the diagonal and to second order on the diagonal, are 
bounded in magnitude by n"^" for some sufficiently small but fixed cq > 0, and are 
supported on at least three points. Let G : R — )■ M obey the derivative estimates 

(8) \£-G{x)\=Oin^'') 

for < j < 5. Let zq — E + \J— Ir/o be a complex number with \E\ < 2 — 6 for some 
fixed S > 0. Then 

EG(log I det(M„ - V^zo)\) - EG(log | det(M; - V^zo)|) - 0{n-') 

for some fixed c > 0, adopting the convention that G{—oo) = 0. 

If E ~ 0, then the requirement that the real and imaginary parts of the atom 
distribution are supported on at least three points can be dropped. 

We prove this theorem in Section 21 following a preparation in Section [3] The 
requirements that M„, be supported on at least three points, and that E lie in 
the bulk region \E\ < 2 — 6 are artificial, due to the state of current literature on 
level repulsion estimates (see ProDOsition[T4|. It is likely that with further progress 
on those estimates that these hypotheses can be removed. The hypothesis that 
the atom distributions have independent real and imaginary parts is mostly for 
notational convenience and can also be removed with some additional effort. The 
hypothesis that the entries are bounded in magnitude by n'^" is, strictly speaking, 
not satisfied for distributions such as the Gaussian distribution, but in practice we 
will be able to reduce to this case by a truncation argument. 

By combining Theorem [5] with Theorem U] we obtain 



A CENTRAL LIMIT THEOREM FOR THE DETERMINANT OF A WIGNER MATRIX 7 



Corollary 6 (Central limit theorem for log-determinant of Wigner matrices). Let 

Mn be a Wigner matrix whose atom distributions are independent of n, have 
real and imaginary parts that are independent and match GUE to fourth order, and 
obey Condition Clfor some sufficiently large absolute constant Cq. Then 

log|det(M„)|- llogn! + |logn ^ 

% log n 

If Mn matches GOE instead of GUE, then one instead has 

log|det(M„)|-llogn! + |logn ^ 
^/Iogn 



The deduction of this corollary from Theorem[5]and Theorem|4]is standard (closely 
analogous, for instance, to the proof of '42, Corollary 21], which establishes a similar 
central limit theorem for individual eigenvalues of a Wigner matrix) and is omitted. 
(Notice that in order for the atom variables of M„ match those of GUE to fourth 
order, these variables must have at least three points in their supports.) 



1.1. Notation. Throughout this paper, n is a natural number parameter going 
off to infinity; in particular we will assume that n > 100 (so that log log log n is 
well-defined). A quantity is said to be fixed if it does not depend on n. We write 
X = 0{Y), X < y, or y > X if one has \X\ < CY for some fixed C, and 
X ~ o{Y) if one has X/Y — > as n — J' oo. Absolute constants such as Co or cq are 
always understood to be fixed. 

We say that an event E occurs with high probability if it occurs with probability 
1 — 0(n^^) for some fixed c > 0, and with overwhelming probability if it occurs with 
probability 1 - 0{n~^) for all fixed A> 0. 



1.2. Acknowledgments. We thank Brad Rodgers and Zhigang Bao for references, 
Peter Eichelsbacher, Xiuyuan Cheng, and the anonymous referee for corrections, 
and Rowan Killip for suggesting and explaining the tridiagonal method. 



2. Central limit theorem for GUE 



We now prove Theorem |4l For notational reasons we shall take n to be even, but 
the argument below can easily be verified to also work with minor modifications 
when n is odd. We will use a method suggested to us by Rowan Killip (private 
communication), and loosely based on the arguments in |28) . 

We will work for most of this section with the GUE case, and discuss the changes 
in the numerology needed to address the GOE case at the end of the section. 

The starting point is the following beautiful observation of Trotter ^47^ : 
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Proposition 7 (Tridiagonal form of GUE). [47] Let M'^ he the random tridiagonal 
real symmetric matrix 



/ ai 


bi 


. 







bi 


0,2 


b2 ■ 











b2 


as ■ 














. 


a„„i 




Vo 





. 


■ bn^i 


an J 



where the ai, . . . , a„, 6i, • . . , bn-i are jointly independent real random variables, with 
ai, . . . , a„ = -/V(0, 1)m being standard real Gaussians, and each bi having a complex 
X- distribution: 

where Zij = N(0, l)c are iid complex Gaussian^. Let Mn be drawn from GUE. 
Then the joint eigenvalue distribution of Mn is identical to the joint eigenvalue 
distribution of M^. 



Proof. Let M„ be drawn from GUE. We can write 



UMJJ* 



where Mn-i is drawn from the n — 1 x n — 1 GUE, a„ = A''(0, 1)r, and Xn € 
C"~^ is a random Gaussian vector with aU entries iid with distribution N{0, l)c- 
Furthermore, M„_i,X„,a„ are jointly independent. 

We now apply the tridiagonal matrix algorithm. Let bn-i ■= |X„|, then 5„ has the 
X-distribution indicated in the proposition. We then conjugate Mn by a unitary 
matrix U that preserves the final basis vector e„, and maps Xn to 6„_ie„_i. Then 
we have 

Mn-i 5„_ie„_i 

•bn-lCn-l a„ 

where M„_i is conjugate to A/„_i. Now we make the crucial observation: because 
Mn-i is distributed according to GUE (which is a unitarily invariant ensemble), 
and 17 is a unitary matrix independent of M„_i, M^-i is also distributed according 
to GUE, and remains independent of both and a„. 

We continue this process, expanding C/M„J7* as 

'Mn-2 Xn-1 
Xn-i On-l bn-1 

bn-1 a„ 

Applying a further unitary conjugation that fixes e„_i,e„ but maps Xn-i to 
bn-2Gn-2, wc may replace Xn~i by bn-2en-2 while transforming Mn-2 to another 
GUE matrix Mn-2 independent of a„, 6„_i, a„_i, 6„_2. Iterating this process, we 



°In other words, the real and imaginary parts of 2^ j are independent with distribution 
A^(0,1/2)k. 
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eventually obtain a coupling of M„ to by unitary conjugations, and the claim 
follows. □ 



In what follows, we are going to prove the limit law for the model and hence 
for Mn- Since has expectation i and varianc60 i, we can write it as 

= i + ViCi, 

where Ci has mean and variance 1. 

By the properties of normal distribution and chi square distribution (or from 
concentration of measure inequalities), we have the following tail bound. There are 
constants Ci, C2 > such that for alH > 1 and t >0, one has 

(9) P(max{|a,|, |c,|} > t) < Ci exp(-t^^). 

Let denote the upper left i x i minor of M^, and write Di := detM/. From 
cofactor expansion we have the recursion 

for alH > 2. To prove Theorem 21 we need to establish the law 
log|D„|-ilog (n-l)!-ilog7 



5 log n 



A^(0,l)i 



It will be convenient to skip the first few terms of this recursion. Let m be a 
sufficiently slowly growing integer- valued function of n (e.g. m :— [log log log nj 
will suffice); we will only apply this recursion for i > m. Notice that Di ^ Q with 
probability one for all i. 

We then have 



Di = GiDi^i - (i - 1 + \/i - lci_i)A-2. 
To mostly eliminate the i — 1 factor, we introduce the normalised determinants 

D, 



and conclude the recurrence 



di 



E, = -^E,^i - + ^\E, 



By Taylor expansion we can rewrite this as 

(10) i.. = ^i..-.-(i + ^-i + o(^))i.._.. 

Our task is now to show that 

. . log \En \ + T log^l 

(11) ' " ^ -^jV(0,l)R. 

5 log n 



^Note that the more familiar real chi squared distribution x? would have variance 2i here, but 
fe| has the complex chi squared distribution which has variance i. 
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To deduce this central limit theorem from (fTO|) , we would like to write log \ Ei \ as a 
sum of martingale differences. But it is rather hard to do from the above recursive 
formula (1101) . We will need to perform an additional algebraic manipulation to 
obtain a more tractable formula involving the closely related quantity Fj := + 
In particular, we will establish 



ave 



Proposition 8 (Central limit theorem for F„/2)- W^e h, 

, N log-F„/2 + \ logn 

(12) n/22 

V 2 log n 

We now prove this proposition. The idea is use Taylor expansions (which can be 
viewed as a discrete version of Ito's stochastic calculus) to approximate logF„/2 + 
ilogn as the sum "Tj'^i martingale differences, to which the martingale 

central limit theorem may be applied. 

We turn to the details. From ([T0| for i = 2j, 2j — 1 we first observe the crude 
bound 

(13) F.^OiYf^'^F,^,)^ 

where Y, 1 + |a2j | + |c2j-i| + |a2j-i| + |c2j-2|- Observe from that 

(14) Ey/ < 1 
for any fixed I. 

Next, we apply (fTO)) for i ~ 2j, 2j — 1 and use Taylor expansion (using ([13]) to 
bound error terms of order j^"^^^ or better) to obtain 

" 7%^^^"^ ~ ^ " i^)^2i-2 + O 

p _ .a2j--J_ C2j-2 i -J ^p 

where are random variables bounded in magnitude by 0(Y^^^^) and with 

mean zero. (In fact, we can obtain a denominator of instead of j^/^ in the error 
terms here, although this improved error term will not persist in our later analysis.) 
Substituting the second equation into the first (and again using (fT3|) to handle all 
terms of order or better), we also obtain 

E2j = -{1 + 



where rj'^' , r^^' obey the same sort of bounds as r^^' , rj^' . We may rewrite these 
estimates in matrix form as 
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where Gj is the near-Gaussian matrix 



(16) G. 



-C2J-1 -a2j 

and Rj is a random matrix depending on j, a2j-i, a2j, C2j-2, C2j-i with mean zero 
and whose entries are bounded by 0(|K,|'^^^-'). (We remind the reader at this point 
that the imphed constants in the 0{) notation are independent oi j.) 



E2j 



Using ((T5|) . we can express 



We can collect some terms, splitting G*Gj as the sum of 2 and the mean zero 
random matrix G*Gj — 2, and obtain the expansiorQ 

(17) F, -- 

where 

Eo 




— 2 

and 



i?2j-2 £'2j-3) G* 



and i?^ is a random matrix depending on j,a2j-i,a2j,C2j-2,C2j-i with mean zero 
and entries bounded by 0(Y^^''^^). 

We can expand hj as 
(18) 

/ \ ^2j-2 , / N ^2j-3 , I N E2j-2E2j-'i 

\-C2j-i)-^ h {-C2j-2)-^ — -^^2 ^ {^23-1 - a2j j "^^a ■ 

-^2j"-2 ' -^2j"-3 2j"-2 ^ -^2j"-3 2j-2 ~^ ^2j-3 

As the c/, a/ are independent and all have mean and variance 1, we conclude that 
for any fixed i?2j-2 and i?2j-3, hj also has mean zero and variance 1, thus 

(19) Eihj\Sj-i) = 0; ^ h 

where £i is the cr-algebra generated by the random variables oi , . . . , 02! and Ci , . . . , C2/-1 
(or equivalently, by the entries of the minor M2/). Similarly, for any fixed choice of 
E2j-2, £'2j-3, kj is a real random variable with mean zero, and thus 

(20) E(fc,|£,_i)=0. 
Also, from construction, hj, kj — 0{Y^''-^''). 



'^The term here arises from combining three contributions —Ix^ + ^x (— 1)H — h=s X — x 2. 
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Taking logarithms in ([T7| . we obtain 



V2, ,1, . ^ Y. 



Oil)' 



\ogFj = logFj_i + log 1 + — + -y=hj + -kj + O 



By telescoping series, we may thus write 

n/2 / 

log F„/2 - log F„, + J2 log I 1 + + O 



j3/2 



where 

1 1. 

— + + -i 

For m sufficiently slowly growing in n, we clearly have 



(21) Xj T- + -^hj + -kj. 



log F,n = o(Vlogn) 

with probability 1 — o(l), since Fm is almost surely finite with a law that depends 
only on m and not on n. To prove (1121) . it thus suffices to show that 

J:;LI+i log f 1 + x,+0 (^)) + i log n 

(22) ^ =^ >N{0,l)u. 

V2 log n 

The next step is to use Taylor expansion to approximate the logarithm to extract 
something that more closely resembles a martingale difference. Observe that Xj — 
0(y.°^^Vj^^^)- From (HH), we conclude that with probability 1 - 0{j-^°°) (say), 



the expression I + xj + 0{-rjj2-) lies between 1/2 and 3/2 (say). From the union 
bound, we thus see that with probability 



one has 



1- E oim-i-o{i), 

j—m-\-l 

log (^1 + x,+0 (^^^ ^=x,- xy2 + O (^^^ 

for all m + 1 < j < n/2. As hj has variance 1, we can split h'j as the sum of 1 and 
the mean zero random variable ^ 1. From (|2ip we may thus expand 

where kj is a random variable bounded by 0{Y^^'^^) which has conditional mean 
zero: 



E(fc;|5,_i) = 0. 

Similarly, from (|14p and the union bound again, the 0{Y^^^'^^ /j^^^) error terms 
are 0{l/j^'^) (say) with probability 1 — o(l), and thus we see that with probability 
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0(1), we have 



^ , A fy^'W ^ V2, , 1,, 1 



(23) log ^1 + X, + ^-j^ j j = ^h, + -fc;. - - log n + 0(1). 
To prove (P^ . it thus suffices to show that 

(24) ^^-";,f ^ " ^^mi)«. 

V2 log n 

Observe that as each k'j are martingale differences, which have variance 0(1) thanks 

to (fT4l) . As such, the expression f^j ^^^^ variance X]j=ni+i ^(^/i^) ^ 

o(log n) and can thus be discarded. If m is small enough, then the expression 
Sjli variance o(logn) and can similarly be discarded. Thus it suffices 

to show that 

y^n/2 1 ^ 

(25) ^'=;^ ' ->iV(0,l)«. 

In order to verify (|25p. we need to invoke the martingale central limit theorem: 

Theorem 9 (Martingale central limit theorem). Assume that Ti, . . . ,T„ are mar- 
tingale differences with respect to the nested a-algebra £o,£i, ■ . ■ ,£n~i- Let vf-^ :— 
^(T^\£^-l), and 4, := J^^^^i ^{T^)- Assume that 

• Vn/sn — >■ 1 m probability; 

• (Lindeberg condition) for every e> 0, s^'^ Y!h=o '^i'^i+i'^Ti-^i>esn) — >■ as 
TO — >■ oo. 

Then ^ iV(0, !)». 

Proo/. See [1 Theorem 1]. □ 

We apply this theorem with Tj :— -^hj. From ([T9|) one has E(Tj+i|fj) = and 

E(T/+J£:j) = i, and hence also E(r/+i) = j. Thus t;„ = s„ = log^/^ n + 0(l); this 
gives the first hypothesis in Theorem |9l 

Now we verify the Lindeberg condition. From (fT4| we have F,{Tf_^-^^\£i) = ■jj^j^'E{hfj^i\£i) ^ 
-k and hence 



E(I^2^i1t.+,>.sJ < e-^s-^ET^, « 
since 5n — log""^^^ ?^+0(l), the claim follows. This concludes the proof of Proposition 

m 

Proposition [8] controls the magnitude of the vector ( J^'^^ ] when j = n/2. We 
will however be interested in the distribution of E,-, , and so we must also obtain 
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where Dj is the matrix 



information about the phase of this vector also. To this end, we express this vector 
in polar coordinates as 

for some 9j G R/27rZ, where we introduce the sign (—1)-' to cancel the —1 factor in 

m- 

Proposition 10 (Uniform distribution of 6'„). One has 

On u 

as n ^ oo, where u is the uniform distribution on M/27rZ. 



Proof. By the Weyl equidistribution criterion, it suffices to show that 

for every fixed non-zero integer k. 
Fix k. Inserting the polar representation (j26p into (|15p . we obtain the recursion 

(^^) (:m::)-^(^--^)(:mt;! 

1 1 r.^^'^ 

where i?" is a matrix obeying the same properties as Rj or R'j , and Cj is a non-zero 
scalar whose exact value is not important for us. 

We extract the components ej , fj of Dj in in the orthonormal basis formed by the 
two vectors (cos0j_i,sin0j_i) and (— sin(?j_i,cos0j_i), thus 

ej (- sin^j_i, cos^^.i) {^^^^^^^l 

and 

fj — (cos0j_i, sin6lj_i) 

From (|27p we thus have 

-i-(cos6'j, sin^j) = (1 -t- /j)(cos ^j-i, sin6'j_i) -I- ej(— sin6'j_i, cos0j_i), 

and so we have a right-angled triangle with base 1+/^, height Cj, and angle 
Elementary trigonometry then gives 

tan(0,-0,_i) = ^^. 



-0.49 

I — "^U 

(say). Using the Taylor expansion of arc tan and 1/(1 + x) we obtain 



By p^ . we see that with probability l~0{j ^°°) (say), we have ej,fj — 0[j ''■^^) 
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and thus 



2 



with probabihty 1 — 0{j ). Hence 



k' 



2 



Ee'^^^^ = Ee*'=^^-i (1 + zfce^ - ike^fj - y e^) + 0(r '■*^)- 

Now from ([^5]). ([TS]). we see that after conditionmg on 0j_i, has mean 
0(j'~^-^^) and variance ^ + 0(j~"'^'^^), and that e^/j has mean 0(j^^'^^). We 
conclude that 

for any I < j < n. Telescoping this, we see that 

n 

for any 1 < m < n. Bounding lEe**"'^™! by 1 and choosing m to be a slowly growing 
function of n, we obtain the claim. □ 



From the above proposition, we see in particular that 

T-— < Icos^n/al < 1 
logn ' 

(say) with probability 1 — o(l). Since £'„ — (^l)"^'^-^,l/2 cos6'„/2, we thus have 
log|£;„| = ilogF„/2 + 0(loglogn) 

with probability 1 — o(l); combining this with Proposition [HI we see that 

log|g„P + llogn 

^ iV(U, IjR. 

V2 log n 

The claim (ITT]) then follows. 

2.1. The GOE case. We now discuss the changes to the above argument needed 
to address the GOE case. The analogue of Proposition [7] is easily established, 
but with the changes that the aj now have the distribution of A^(0, 2)k instead of 
A'^(0, 1)r, and the bj now have a real ^-distribution instead of a complex one (thus 
the Zij are now distributed according to A''(0, 1)r instead of N{0, l)c)- The effect 
of this is to make the random variables aj , Cj in the above analysis have variance 2 
instead of 1 (but they still have mean zero). As a consequence G*G now has mean 
4 rather than mean 2, which means that the ^ term in p7)) becomes jr. On the 
other hand, the random variables hj now have variance 2 instead of 1. These two 
changes cancel each other out to some extent, and in particular the assertion 
remains unchanged. Finally, when applying the martingale central limit theorem, 
the variances w^, are now 21ogn + 0(1) rather than logn + 0(1), again thanks 
to the increased variance of hj . The remainder of the argument goes through with 
the obvious changes. 
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3. Resolvent swapping: a deterministic analysis 



In this section we study the stabihty of Hermitian matrices with respect to per- 
turbation in just one or two entries. To formahse this we wih need some definitions. 

We will need a number of matrix norms. Let A = {aij)i<i,j<n be a matrix, and 
let 1 < p, g < 00 be exponents. We use ||^||(g,p) to denote the £p — i"^ operator 
norm, i.e. the best constant in the inequality 

\\Ax\\i. < \\A\\^q,p)\\x\U. 

Thus for instance \\A\\ (2,2) is the usual operator norm. We also observe the identities 



mil (00,1) = sup lay I 

l<i,j<n 



and 



||A||(^,2):= sup (^|a,,f)i/^. 

In particular one has the identity 

(29) \\A\\^^^2) = \\AA*\\l/^^^y 
By duality one has 

(30) \\Mi,,p) = \\A*\\ip,,,'), 
where 1/p+l/p' = 1/q+l/q' = 1. 

We observe the trivial inequality 

(31) Pi3||(,,p)<||^||(,,g)||i?||(,,p) 
for any A, B and 1 <p,q,r < 00. 

Next, we need the notion of an elementary matrix. 

Definition 11 (Elementary matrix). An elementary matrix is a matrix which has 
one of the following forms 

(32) V = Cadi, e-aP*b + etel, V^eacl - V^e^e* 

with 1 < a,b < n distinct, where ei, . . . , e„ is the standard basis of C". 

Observe that 

(33) \n,,p) « 1 

and 

(34) I trace(Ay)| = j trace(y^)| = 0(||A||(g,p)) 
for all 1 < p, g < 00 and all n x n matrices A. 

Let Mq be a Hermitian matrix, let z = i? + «?7 be a complex number, and let V be 
an elementary matrix. We then introduce, for each t the Hermitian matrices 

Mt := Mo + -^tV, 
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the resolvent 

(35) Rt ^ Rt{E + 17]) := (Mt - E - n^)-' 
and the Stieltjes transform 

St := st(E + if]) — trace Rt(E + i-q) 
n 

and study how Rt and St depend on t. 
We have the fundamental resolvent identity 

Rt = Rq i=RQVRt 

which upon iteration leads to 

(36) Rt = Ro+J2 {-^y {RoV)^Ro + (-^) ^ iRoV)''+'Rt 



Under a mild hypothesis, we also have the infinite limit of (|36p : 

Lemma 12 (Neumann series). Let Mq be a Hermitian n x n matrix, let E G 
77 > 0, and f € M, and let V be an elementary matrix. Suppose one has 

(37) |i|||-Ro||(oo,i) =o(V^). 

Then one has the Neumann series formula 



(38) 



t 



Rt^Ro + J2[-^j [R^vyRo 



with the right-hand side being absolutely convergent, where Rt is defined by (|35p. 
Furthermore, for any 1 < p < 00 one has 

(39) \\Rt\\(oo,p) < (1 + O(l))||i?o||(oo.rt- 

In practice, we will have t = n'^^'^"^ (from a decay hypothesis on the atom distribu- 
tion) and ||i?o|| (00,1) — n'^^'^°'> (from eigenvector delocalisation and a level repulsion 
hypothesis), where cq > is a small constant, so ([57)1 is quite a mild condition. We 
also remark that by replacing Mq and t with Mq + tV and —t respectively, one can 
swap the roles of Rq and Rt in the above lemma without difficulty. 

Proof. We rewrite ([55)1 as 

(40) - (-^)'^' iRoV)'+'^ R^ = R„ + J2(^-l^y (R,V)^R, 

for all fc > 0. Sending fc — > 00 we will be able to conclude ([55]) (in a conditionally 

/ s fe + l 

convergent sense, at least) once we show that f — -^j (RqV)'^'^^ converges to 
zero (in, say, ||^||(oo,i) norm) as fc — > 00. But from p3p . (1311) we have 



t 



fc+i 



<0[\^^\\R,t^\ 



(00,1) 
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From ([37| . this decays exponentially in k, and this gives (p8|) (and also demonstrates 
that the series is absolutely convergent). 

Taking (oo,p) norms of (|38)) . one has 

\\Rt\\ioo,p) < l|-Ro||(oo,p)+^ ( ||(i?0'^Fi?0||(oo,p). 

But from ^ one has 

\\{RoVyRo\\ioo,p) < (||i?0||oc4ll"t^l|l,oo||)-''||i?0|lco,p = O(||i?0||oo,l)''|1^0||(oc,p) 

and the claim follows from ([57|. □ 

We now can describe the dependence of st on t: 

Proposition 13 (Taylor expansion of st). Let the notation be as above, and suppose 
that p7p holds. Let k >0 be fixed. Then one has 

k 

(41) s, = so+Y,n-^/^c,t^+o(n-(''+'^/'\t\'+^Ro\\l+\unn{^^^^^^ 

j=i V ' / 

where the coefficients Cj are independent of t and obey the bounds 

(42) c, = O f ||i?o||{^,i) min{||i?o||(oo,i). — }) ■ 
for all 1 < j < k. 

Proof. If we take normalised traces of ((36|) . we obtain 

St = so + ^n-^"/2cji^- + n-^^'+D/^i'^+Vf ) 
i=i 



where cj are the coefficients 



Cj := {-ly^tYaceiiRoVyRo) 



and Tf'^'' is the error term 



r^t''^ := (-l)'=+i-trace((i?o^)''+'i?t)- 



To estimate these coefficients, we use the cyclic property of trace to rearrange 
trace((i?ol^)^.Ro) = trace(l/(i?oV")^'"^-Ro) 
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and thus by dS]), dSI]), dM]) we have 



|c,|«i||(i?oV^y-^i?^||(oo,l) 

«^Po^ll(;o!oo)Poll(oo,l) 

« ^(l|i?o||(oo,l)||^^||(l.oo))^"'||i?o'll(oo.l) 

«^Po||{o,'i)P^ll(oo,l)). 



We can bound this in one of two ways. Firstly, by pip we have the crude inequahty 

||^^oll(oo,l) < ||-Ro||(oc,oc)||-Ro||(oo,l) < "-Il^o||(oo,l) 

(coming from the bound ||a;||£i < n||a;||^oc), leading to the bound 

c, = 0{\\Ro\\lt\))- 
Alternatively, we can use (|3T|) . ([30)) . (|29t to bound 

||^oll(oo,l) < ll^o||(oo,2)||^o||(2,l) 

< ll^o||(oo,2)ll^3ll(oo,2) 

< Poi?sii;£i)Ps^oii;i^). 

But from the definition ((35)) of the resolvent, one has the identity 

n n* _ n* p _ ^0 ~ -^0 
-Ko-Kq — ^0^0 — 77- 

and thus from the triangle inequality and (1301) 

ll^oll(oo,l) < -||-Ro||(oo,l)- 



This gives 



Combining the two bounds on Cj we have (|42p . A similar argument can be used to 
bound rl'^^ (using ([39)) to replace Rt by Rq at some stage of the argument), so that 

(43) rf ) = O (^||i?o||^i\)min{||i?o||(oo,i) A}) ■ 

The claim (gD) follows. □ 



4. Proof of Theorem [5] 

In this section we prove Theorem [5] Let M„,M4 be as in that theorem, with Cq 
sufficiently small to be chosen later. Call a statistic S{M) that can depend on a 
matrix Af highly insensitive if one has 

\S{M,,)-S{AQ\^0{n-') 
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for some fixed c > 0. Thus our task is to show that EG(log | det(M„ — v^-^q)!) 
is highly insensitive for all zq and all G obeying ([S]). By dividing G by n''° (and 
reducing the size of cq if necessary) we may improve ([S]) to the estimates 

(44) |£-G(.)| < 1 
for aU x e M and < j < 5. 

By truncating the atom distributions (and re-adjusting to keep them at mean 
zero and unit variance) and using Condition CI, we may assume without loss of 
generality that we have the uniform upper bound 

(45) Id < n'" 

on the atom distribution (see [5^ Chapter 2] or [34l Appendix A] for more details 
on the truncation technique). 

Set Wn -^Mn (and W;, ^M^). Then 

log I det(Af„ - y/nzo)\ ^ ^nlogn + log | det(W„ - zo)\. 

By translating G by ^nlogn (which does not affect the bounds (01])), it thus suffices 
to show that EG'(log | det(W„ — zo)\) is highly insensitive. 

Write zq = E + -\/— l?7o. By conjugation symmetry we may take rjo > 0. We first 
dispose of the easy case when rjo > n^'^^ . In this case we have 

log I det(W^„ - zo)\ =n\og \zo\ + log | det(l - z^^Wn)\ = nlog \zo\ + 0(n-5°) 

(say), thanks to (|45|) . The claim then follows easily in this case from (|44p . 



We now restrict to the main case < r^o < n^'^'^. From the fundamental theorem 
of calculus one has 



log |A - Zo\ = log |A - S - V^n^""! - Im / 

J rin 



drj 



X- E- V^r/ 



and hence 
(46) 



log I det(W„ - zo)| = log I det{Wn -E- ^/^n^^^)\ - nim / s{E + V^v) dr] 

J rin 



where 



s{z) = sw^{z) = itrace(W^„ — ^ 
n 



is the Stieltjes transform of Wn- 

The previous analysis and then gives 

log I det( W„ - zo) I = n log n^"° + 0{n-^°) ~ nlm s{E + V^rj) drj 
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By translating (and reflecting) G once more, it thus suffices to show that the quan- 
tity 



,100 



EG l^nlm j s{E + V^v) d^j 
is highly insensitive. 

We next need the following proposition. Let Ai(W„) > ... > A„(W„) denote 
the eigenvalues of Wn (counting multiphcity) , and let ui{Wn), ■ ■ ■ ,u„(T4^„) be an 
associated orthonormal basis of eigenvectors. 

Proposition 14 (Non-concentration). With high probability, one has 

min \X,{Wn) ~E\> n-'^-"" 

l<i<n 

and with overwhelming probability one has 

Ni^O{n\I\) 

whenever I is an interval of length \I\ > 77,^1+^^0 ^ sufficiently large constant 
A > 0. Also, with overwhelming probability one has 

sup \\u,{Wn)\Uo. <„rl/2+O(-0) 
l<i<n 



Proof. The second claim follows from [42, Proposition 66] and the third claim fol- 
lows from [42, Proposition 62], so we turn to the first claim. 

The results in [33j only give a lower bound of for some fixed C, which is 
not quite enough for our purposes. On the other hand, if the atom distribution is 
sufficiently smooth, the claim follows from existing level repulsion estimates such 
as [31] or [12], which are valid in the bulk region \E\ < 2 — S. To extend to the case 
when the real and imaginary parts of the atom distribution are supported on at 
least three points, one can use the Four Moment Theorem (see [42]) and a moment 
matching argument (see e.g. the proof of [42, Corollary 24]). We remark that these 
are the only places in which we use the hypotheses that \E\ < 2 — 5 and that the real 
and imaginary parts of the distribution are supported on at least three points. It 
is likely that by improving the results in the above cited literature, one can remove 
these hypothesefl □ 



As a consequence of Proposition 1141 we obtain a upper bound on the (imaginary 
part of the) Stieltjes transform: 

Corollary 15. For a sufficiently large constant > (independent of cq), one 
has 

(47) Ims(£; + \/^n-i~2Aoco) < 

with high probability. 



^For instance, the results in |49| do not need the support hypothesis, but require the energy 
E to be zero and the imaginary part to vanish. It may however be possible to remove these 
hypotheses from the results in | 49| . which could lead to an improvement of the proposition here. 
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Proof. The left-hand side of (|47]) can be written as 

n ^ 
2-2A0C0 \^ i 

By Proposition[T4l we assume with high probabihty that there are at most 0{n^'^°) 
eigenvalues Xi{Wn) that are within y^-i+'^co Qf j^^ i^^^ ^j^g^^ g^ch eigenvalues are at 
least n~^~'^° away from E. The total contribution of these eigenvalues to the above 
expression is then at most 0(n(~2'4o+A+2)co^_ Similarly, by using Proposition [HI 
and dyadic decomposition of the spectrum around E, we see that the contribution 
of the eigenvalues that are further than t^-i+'^co away are O(n*^~2^o-'4)co^ with 
overwhelming probability. Combining these bounds we obtain the claim if Aq is 
large enough. □ 



Let X : R — R be a smooth cutoff to the region \x\ < n~'^°'^° that equals 1 for 
|a;| < n~^°''° /2. From the above corollary, x(Ims(i!J + \/^^n~^~-^^"'^")) is equal to 
1 with high probability. Thus it suffices to show that 

(^100 \ 
nlm^^ s{E + V^rj) dr] \ x {lms{E + 

is highly insensitive. 



We now view as obtained from M„ by swapping operations, each of which 
either replaces a diagonal entry of M„ with the corresponding entry of M/^, or 
replaces the real or imaginary part of an off-diagonal entry of Af„ (and its ad- 
joint) with the corresponding entries of (leaving the other part of that entry 
unchanged). Of these swapping operations, n of them will involve a diagoanl 
entry, and the other — n will involve an off-diagonal entry. It will suffice to show 
that each swapping operation only affects pS)) by 0(n^2-c~) ^j^g off-diagonal 
case and 0{n^^^'^) in the diagonal case for some fixed c > 0. In fact we will ob- 
tain a bound of the form 0(n~^/2-i-C>(co)'j (where the implied constant may depend 
on A,Ao) in the off-diagonal case and 0(n~'^/2-i-0(co)^ j-^ ^j-^g diagonal case, which 
suffices for cq small enough. 

Let Mn ^ , Mn ^ be two adjacent matrices in this swapping process, and let Wn^^ , Wn^'' 
be the associated normalised matrices. Then we can write 



wi^'> = Wo + ^C^^V 

where V is an elementary matrix, are real random variables matching to 

fourth order and bounded in magnitude by 0(n'^^^''''), and Wq is a random matrix 
idnependent of both ^'"^ and We can then write (|48l) for Wn^^ using the 

notation of the preceding section as 

EG(nIm / s^(„, {E + ^^77) dr])x{luys^m {E + ^/^n-^-^^""")) 
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and we wish to show that this expression only changes by 0{n 5/2+o(co)-| -^^j^gj-^ ,^(0) 
is replaced by ^'-^^ in the ofF-diagonal case, or 0(n^''/2+C'(co)') ^j^g diagonal case. 

We now bound the resolvent: 

Lemma 16 (Resolvent bound). If x{s^{o>{E + in^^-'^^°''")) is non-vanishing, then 
with overwhelming probability 

sup||i?.(o)(S + \/^77)|l(^,i) < 

r;>0 

and 

(49) sup|li?o(i^ + "/)|l(oo,i) 

r]>0 

Proof. From spectral decomposition one has 



||i?4(ii; + V^77)ll(oo,i) <E 



^ \X,{W,,) - E - 

Applying the last statemeni0 in Proposition I14| we conclude with overwhelming 
probability that 

" 1 

||i.,(i. + V^,)||,^,,<n-^^"(-o)g ^^^^^^^_^^ . 

Arguing as in Corollary [T51 one see that if x{s^{E + ^/ —ln^^^^^°'^°)) is non- 
vanishing, then with overwhelming probability 



2-.\X,iW^)-E\ ' 

and the first claim follows. The second claim then follows from Lemma fT^ (swapping 
the roles of and ^(°)). □ 

We now condition to the event that ([^^ holds. To begin with, let us assume we 
are in the ofF-diagonal case. Then by Proposition we have 

s.iM{E+^r^) = so{E+V^7])+Y{^'-'^yn-^^^cJrj)+0{n-^^^+^^^°^)mm{l, —} 

for i = 0, 1, where the coefficients Cj enjoy the bounds 

Cj^oi n^^^^o^minll,— } I . 
\ nrj J 

From this and Taylor expansion above we see that the expression 

100 



G 



I nim J Sj(o {E + \^ri) drj j x (lms5(£' + V-Tn-^^ 



^To be precise, we need to apply this statement for wji''^ , but the proof for this matrix is the 
same as for Wn; see 1421 . 
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is equal to a polynomial of degree at most 4 in <^'^*) with coefficients independent 
of plus an error of 0(?i^^/^+'^*^'^''^). Taking the expectation and using the four 
moment assumption, we obtain that the difference between the expectations of G 
with respect to ^'"^ and ^^^^ is 0(n~^/^+'^('"')), as desired. 

In the diagonal case, one argues similarly, except that one only is assuming two 
matching moments, and so one should only Taylor expand to second order rather 
than fourth order. This concludes the proof of Theorem [5] 

Appendix A. Moment calculations 

In this appendix we establish Theorem [3l Our main tool is the Leibniz expansion 

detM„ = ^ 

where for each permutation cr : {1, . . . , n} — >■ {1, . . . , n}, /o- is the random variable 

n 

la ■■= Sgn(cr) Jl Ci<7(») ■ 
i=l 

We begin with the first moment computation. Clearly 

EdetM„ = ^ E/^. 

Because all the have mean zero and are jointly independent on the upper tri- 
angular region 1 < i < j, we see that E/o- vanishes unless a consists entirely of 
2-cycles (i.e. is a perfect matching), in which case E/o- = 1. Thus, Edet7\f„ is the 
number of perfect matchings on {1, . . . , n}, which is easily seen to be zero when n 
is odd and , ,„?,'„„ /o when n is even. 

Now we turn to the second moment computation for GOE, thus we seek the bounds 

(50) n^/^n! < E| det M„p < n^/'^nl. 
We may of course assume that n is large. 

From the Leibniz expansion we have 

(51) E|detM„|2= ^I^Tp- 

Actually, as GOE has real coefficients, we can omit the complex conjugate over the 
Ip term. 

Now we investigate the expressions EI^Ip. This expression can be estimated using 
the cycle decomposition of a and p. It is not difficult to see that this expression 
will be zero unless the following conditions are satisfied: 

• If 7 is a cycle in a of length other than two, then either 7 or its reversal 
7^^ is a cycle in p, and conversely. 
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• The support of the 2-cycles in a equals the support of the 2-cycles in p. 

Furthermore, if the above conditions are satisfied, then EI„Ip is equal to 2^^3'^, 
where Ci is the number of 1-cycles of a (or of p), and c is the number of 2-cycles that 
are common to both a and p. This comes from the fact that the diagonal entries 
of GOE have variance 2, while the off-diagonal entries have a fourth moment of 3. 

Some elementary combinatorics shows that for a given permutation ct, the number 
of permutations p obeying the above conditions is equal to 

C2!2C2li^ ' 

fe>3 

where Ck is the number of fc-cycles of a. (To be more precise, we would have to 
write Ck{(j)-i but as there is little chance of misunderstanding, we prefer using just 
Ck to simplify the presentation.) Thus ([5T|) is thus lower bounded by 

crGSn /c>3 



In the converse direction, for fixed < c < C2, the number of p obeying the above 
conditions and with exactly c 2-cycles in common is bounded by 

C,\ 2(C2-C)! yr 



and so (|51|) is upper bounded by 

Let us first estimate the upper bound. Observe from Stirling's formula that 



(C2-c)!2C2 
This and an elementary calculation show 

C2 



So we may upper bound ([5T|) by 



Using the fact that X]m=i = 0{\/n), we see that to prove the upper bound in 
(l50t. it suffices to show that 



(53) n < 

for each < m < n/2. 
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To establish ([53|) . we use a double counting argumen10 as follows. For each 
permutation cr with exactly m 2-cycles, we assign a quantity F[a) which is the 
product of the number of ways to write down the 2-cycles of a (counting ordering) 
and the number of ways to select some union E of the /c-cycles of a with fc 7^ 2. 

If the 2-cycles of a are (xij/i), . . . , {xmUm), then there are m!2'" ways to write 
them down (counting all permutations in Sm and the permutations between Xj and 
Uj). Furthermore, there are 0^742 2'^'° ways to select E, which is a cr-invariant set 
disjoint from the xi, . . . ^ Xm, ■ • • , Um- This set E has some cardinality j between 
and n — 2m. Therefore, 



On the other hand, there are („_"2m)! '^^Y^ ^o select 2m ordered elements xi, . . . , Xm , yi , 
of {1, . . . , n}. For each j, there are then ^"~2m~j -^^ays to select E, and then to spec- 
ify a on E and on the complement of ii^ U {xi , . . . , Xm , J/i j • • • , 2/™} there are at most 
j!(n — 2m — j)! possibilities. (Notice that a restricted to i? is a permutation on E.) 
Putting all this together, we may bound the left-hand side of ([55]) by 



n — 2m ^ 



and the claim follows 



(n — 2m)! V 7 



Now we turn to the lower bound. From Stirling's formula we have 

2C2! 



C2!2C^= 



» C2!2^VVC2 + 1 



so by (|52|) (and crudely bounding ^/C2 + 1 by 0{^/n) it suffices to show that 

C2!2C'^n2'''»"'x"!- 

For this, it suffices to prove the matching lower bound 
(54) J2 "^•2" Y[2'^'' ^nxnl 

u£S„-C-2=m k^2 

to ([55)1 for each < to < n/4 (say). 

We use the same double-counting argument as before. We write the left-hand 
side of (|5^ as X^o-es ■c^^m^i'^)- We use the classical fact that as n 00, the 
random variables Ci, . . . , Cfc for any fixed k converge jointly to independent Poisson 
variables of intensities 1/1, . . . , 1/fc respectively (see e.g. [1]), so a positive constant 
fraction of Sn is 2-cycle free for n > 0. After fixing xi, . . . , a;„i, yi, . . . , j/m, i/, 



^'^One could in fact obtain much more precise asymptotics on II53I I using the method of gener- 
ating functions, but we will not need to do so here. 
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notice that any 2-cycle free permutation on E and on its complement will give a 
contribution to ()54|) . Thus, we obtain a lower bound of the form 

^ („ _ 2my. [ J )jKn-2m-j)\^nl{n-2m+l), 

concluding the proof. 

Now we consider the second moment for the GUE case. There are three differences 
here. Firstly, the factor of 2*-^^ that was present in the GOE analysis (which arose 
from the fact that the diagonal entries had variance 2 instead of 1) is now absent. 
Secondly (and most importantly), in order for E/o-Zp to be non- vanishing, each 
cycle 7 of length at least three in a must appear in p also; the appearance of the 
inverse cycle 7"^ now leads to cancellation, in contrast with the GOE case. As 
such, the 2'^'= factors for k > 3 are also absent. Finally, the factor of 3"^ in the above 
analysis becomes 2"^, due to the smaller value of the fourth moment ElCyl"* of the 
off-diagonal entries in the GUE case. Repeating the GOE arguments, one reduces 
to showing that 

(y^Sn •■C^—ra 

for all < TO < n/2, and 

to!2" » n\ 

for all < TO < n/4. But this can be achieved by a routine modification of the above 
arguments (with the role of the additional set i?, which represented the nfc5^2 2*^'' 
factor, now omitted). 

Remark 17. An inspection of the above argument shows that the hypotheses that 
M„ are distributed according to GOE or GUE can be relaxed to the assertion that 
Mn matches GOE or GUE to fourth order off the diagonal and to second order on 
the diagonal. 
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